1
کامپیوتر و شبکه::
تنوع واژگانی
Computing the Lexical Diversity of Tweets
A slightly more advanced measurement that involves calculating simple frequencies and can be applied to unstructured text is a metric called lexical diversity.
Lexical diversity is an interesting concept in the area of interpersonal communications because it provides a quantitative measure for the diversity of an in‐ dividual's or group's vocabulary.
The speaker who repeatedly says "and stuff " would have a lower lexical diversity than the speaker who uses a more diverse vocabu‐ lary, and chances are reasonably good that you'd walk away from the conversation feeling as though the speaker with the higher lexical diversity understands the subject matter better.
As applied to tweets or similar online communications, lexical diversity can be worth considering as a primitive statistic for answering a number of questions, such as how broad or narrow the subject matter is that an individual or group discusses.
واژگان شبکه مترجمین ایران